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Abstract 

Experiments in the high-luminosity runs at the Large Hadron Collider face the challenges of 
very large pile-up. Primary techniques to deal with this are based on precise vertex and track 
reconstruction. Outside tracker acceptances, however, lie regions of interest for many aspects of 
the LHC physics program. We explore complementary approaches to pile-up treatment and propose 
a data-driven jet-mixing method which can be used outside tracker acceptances without depending 
on Monte Carlo generators. The method can be applied to treat correlation observables and take 
into account, besides the jet transverse momentum pedestal, effects of hard jets from pile-up. 
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1. Introduction. Experiments at hadron colliders operating with very high luminosity face 
the challenge of pile-up, namely, a very large number of overlaid hadron-hadron collisions 
per bunch crossing. At the Large Hadron Collider (LHC), in Run I data the pile-up is about 
20 pp collisions on average, while it reaches the level of over 50 at Run H, and increases for 
higher-luminosity runs BHni. In regions covered by tracking detectors, advanced vertexing 
techniques have been developed to deal with environments characterized by high pile-up. 
More generally, experiments rely on Monte Carlo simulations which include pile-up for com¬ 
parisons with data. However, this introduces a signihcant model dependence, especially in 
regions where no detailed and precise measurements are available to constrain Monte Carlo 
modeling. 

In this paper we propose a different approach to treating high pile-up, with a view to 
employing data-driven methods rather than Monte Carlo methods. Our main focus is to 
deal with potentially large probabilities that jets with high transverse momenta are produced 
from pile-up events independent of the primary interaction vertex, in a region where tracking 
devices are not available to identify pile-up jets. A typical application would be Higgs 
production by vector boson fusion, where the associated jets may be produced outside the 
tracking detector acceptances. The issue we address is thus quite different from the issues 
that most of the existing methods for pile-up treatment are designed to deal with, which 
are the jet transverse momentum pedestal, due to the bias in the jet transverse momentum 
from added pile-up particles in the jet cone, and the clustering into jets of overlapping soft 
particles from pile-up. 

In what follows we will therefore use standard existing methods to treat soft particles and 
the jet pedestal, and devise new approaches to tackle the issue of misidentihcation which 
arises, in addition, in cases where precise tracking and vertexing are not feasible. The aim is 
to look for methods which treat pile-up without spoiling the physics of the signal process and 
which can be used outside the tracking detector acceptances without depending on Monte 
Carlo modeling. To this end, we suggest using minimum bias (or jet) samples recorded from 
data in high pile-up runs and applying event-mixing techniques to relate, via these data 
samples, the “true” signal to the signal measured in high pile-up. 

The approach does not address the question of a full detector simulation including pile-up. 
Rather, it focuses on how to extract physics signals with the least dependence on pile-up 
simulation, and how to use real data, rather than Monte Carlo events, at physics object 
level. 

The proposed method applies to the regime of high pile-up which is relevant for the LHC 
as well as for future high-luminosity colliders. It is designed to treat not only inclusive 
variables but also correlations. One of the features of the method is that it does not require 
data-taking in dedicated runs at low pile-up. Rather, the data required for event mixing are 
recorded at the same time as the signal events in high pile-up runs, so that there is no loss 
in luminosity. 

We will illustrate the approach using Drell-Yan lepton pair production associated with 
jets as a case study. We discuss two main physical consequences of pile-up collisions, the bias 
in the jet transverse momentum due to pile-up particles in the jet cone, and the misiden¬ 
tihcation of high transverse momentum jets from independent pile-up events. The method 
is general and can straightforwardly be extended to a large variety of processes affected by 
pile-up. 

2. Drell-Yan plus jets at high pile-up as a case study. Let us consider the associated 
production of a Drell-Yan lepton pair via Z-boson exchange and a jet. We take the jet 
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transverse momentum and rapidity to be > 30 GeV, |ry0et)| ^ boson 

invariant mass and rapidity to be 60 GeV < TTT.lt’oson) ^ -^20 GeV, < 2. Event 

samples are generated by Pythia 8 [12] with the 4G tune |T3] for the different scenarios 
of zero pile-up and Npu additional pp collisions at = 13 TeV. We reconstruct jets with 
the anti-ZcT algorithm [T3| with distance parameter R = 0.5. Results for the spectrum in 
the transverse momentum pp of the Z-boson, for Z + jet events, are shown in Fig. for 
Vpu = 0, iVpu = 20 and iVpu = 50. For comparison we also show the inclusive iVpu = 0 
Z-boson spectrum. 

We see from Fig. that the effects of pile-up on Z-boson -|- jet scenarios are large. 
As a result of pile-up the shape of the pt spectrum is changed and the peak is shifted 
to lower values. This can be interpreted by noting that, as the Z -|- jet event sample 
becomes dominated by pile-up collisions, even with the jet transverse momentum selection 
cut > 30 GeV the Z-boson px distribution in boson -|- jet events will tend to approach 
the inclusive Drell-Yan spectrum, given by the solid green curve. 



FIG. 1. Effect of pile-up on the Z-boson transverse momentum px in Z-boson -h jet production at 
the LHC. 

More precisely, we can identify two main implications of pile-up collisions: a large bias 
in the jet transverse momentum due to added pile-up particles in the jet cone leading to a 
jet pedestal, and a large probability that jets with high transverse momentum come from 
independent pile-up events. 

Several methods exist to deal with the jet px pedestal. These include techniques based 
on the jet vertex fraction [3] and charged hadron subtraction nm, the PuPPi method [T6] . 
the SoftKiller method HZI. These methods correct for transverse momenta of individual 
particles, but not for any mistagging. So do approaches inspired by jet substructure studies, 
such as jet cleansing [I8]. In Fig. [^we apply SoftKiller [T7|, a new event-wide particle-level 
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FIG. 2. Application of SoftKiller to Z-boson -h jet production. Left: (a) the leading jet px 
spectrum; right: (b) the Z-boson px spectrum. 


pile-up removal method, which can also be used with calorimeter information only. We 
present results at zero pile-up (A^pu = 0), at pile-up iVpu = 50, and the result at pile-up 
A^PU = 50 with SoftKiller subtraction (A^pu = 50 SK). 

Fig-i illustrates different physical effects of pile-up in the leading jet spectrum and in 
the Z-boson spectrum. In Fig. we compute the leading jet px spectrum, and verify that 
SoftKiller efficiently removes the jet pedestal from pile-up: the zero pile-up jet spectrum 
(solid black curve) is shifted toward larger px by pile-up collisions (dot-dashed black curve 
for A^pu = 50) but the application of SoftKiller (dashed blue curve A^pu = 50 SK) corrects for 
this and restores the original signal with very good approximation. In Fig. |^, on the other 
hand, we compute the Z-boson px spectrum. The solid black curve is the zero pile-up result, 
the dot-dashed black curve is the N-pu = 50 result, and the dashed blue curve is the result of 
applying SoftKiller. In the higher px part of the spectrum we observe that there is no need 
for any correction. In contrast, in the lower px part significant contributions are present 
from misidentified pile-up jets. These are not corrected for, and need to be properly treated, 
particularly in regions outside tracker acceptances where vertexing techniques cannot be 
relied on to identify pile-up jets |6]. We address this point next. 

3. Uncorrelated event samples and jet mixing. To treat effects beyond soft particles 
and the jet px pedestal, we employ event mixing techniques [I9H22] using uncorrelated 
samples. The main idea is that the signal in the pile-up scenario is obtained via mixing 
from the signal without pile-up and a minimum bias sample of data at high pile-up. Thus, 
to identify the contribution of the high px jets coming from independent pile-up events, 
we construct a signal plus pile-up scenario in a data-driven manner. We do this by adding 
physics objects from pileup background to event samples before selection criteria are applied. 
The approach is designed to treat the region of high number Npu of pile-up events, where 
(A^PU + 1)/A^pu ~ 1- 

We illustrate the method by taking a sample containing Npu minimum bias events (which 
could be recorded data but we just take for illustration as Monte Carlo events), mixing this 
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FIG. 3. The Z-boson pT spectrum in Z -I- jet production from the jet mixing method. Left: (a) 
Npu = 50; right: (b) Np\j = 100. 


with the signal at zero pile-up, and then requiring a jet with > 30 GeV, |ryOet)| ^ 

We extract the unbiased signal without relying on Monte Carlo algorithms. 

Fig. [^reports the result of carrying out this procedure, for A^pu = 50 and iVpu = 100. 
Here the solid black curve is the “true” Z-boson plus jet signal. The dashed blue curve is 
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FIG. 4. Maximal value of the relative deviation between the pile-up corrected signal and the true 
signal, with and without jet mixing, as a function of the number of pile-up collisions. Black dots: 
SoftKiller corrected result; open circles: jet mixing method applied. 
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the high pile-up, SoftKiller-corrected result (iVpu = 50 SK and iVpu = 100 SK). As seen 
already in Fig. [^, this is far from the solid black curve in the lower-p-p part of the spectrum. 
We regard the dashed blue curve as pseudodata in high pile-up. The long-dashed red curve 
is the jet mixing curve, obtained as described above by mixing the signal with the minimum 
bias sample. The result of the mixing method is then given as the solid red curve by a 
simple “unfolding”, dehned by multiplying the signal by the ratio of the pile-up (dashed 
blue) curve to the mixing (long-dashed red) curve. We see that without appealing to any 
Monte Carlo method the true signal is extracted nearly perfectly from the mixed sample. 

In addition to the closure test carried out above, we have checked the model dependence 
by applying the mixing procedure to different starting distributions, and verihed that in this 
case as well the unfolding returns the true signal. 

In Fig.|^we plot the maximal value of the relative deviation between the pile-up corrected 
signal and the true signal ((corrected - true)/true), for the SoftKiller case without jet mixing 
(black dots) and for the case with the jet mixing method applied (open circles), as a function 
of the number of pile-up collisions A^pu- We see that, while in the SoftKiller case, in which the 
jet pedestal is removed, the deviation from the true signal becomes larger as A^pu increases, 
the deviation does not increase with A^pu once the jet mixing method is applied to take 
account of the hard jets from pile-up. 
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FIG. 5. Effects on jet resolution. Left: (a) the parton-jet pr correlation; right: (b) the AR 
distribution. 


The main advantages of this approach are that it can be used with data recorded in 
high pile-up, and it does not depend on Monte Carlo algorithms for pile-up correction. In 
addition, it is interesting to perform control checks by examining results for the jet resolution 
which we obtain from the jet mixingmethod. These are shown in Fi g. Fig. reports the 
parton-jet px correlation, and Fig. ^ the distribution in AR = A(fP‘ R where A0 

and Ai] are respectively the separation in azimuth and rapidity. We see that the features of 
the “true” signal are well reproduced. 

f. Conclusions. Current methods to deal with pile-up at the LHC employ precise vertex 
and track reconstruction, in regions where these are available, and in general rely on Monte 
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Carlo simulations to model pile-up for data comparisons. The use of Monte Carlo event 
generators brings in a significant model dependence particularly in regions where these are 
not well constrained by measurements. 

In this paper we have discussed a different, data-driven approach to the treatment of 
pile-up, which makes use of minimum bias or jet samples recorded from data taken in high 
pile-up runs, constructs mixing methods to extract the signal process, and thus circumvents 
the model dependence implied by the use of Monte Carlo generators. 

The methodology is general, and can be applied in measurements to restore correlations 
between hnal-state particles. In such measurements two important kinds of pile-up effects 
are present, exemplihed in the case of Z-boson plus jets which we have used for illustration, 
the jet pt pedestal and the misidentihcation of high-p-r jets from independent pile-up events. 
While several methods exist to correct for the hrst effect (as well as for the related effect 
of the clustering into jets of overlapping soft particles), the second effect is not treated at 
present. We have proposed a jet-mixing method which treats this, and we have shown that 
it allows one to successfully extract the signal process from the mixed sample to within few 
percent. 

The methods discussed in this paper can be applied to the high pile-up regime and do not 
require special runs at low pile-up. The data samples needed for jet mixing are recorded at 
the same time as the signal events. There is therefore no loss in luminosity. The advantages 
are that one can access the proper pile-up distribution and there is no need for pile-up 
reweighting. 

The use of these methods thus implies good prospects both for precision Standard Model 
studies at moderate scales affected by pile-up, e.g. in Drell-Yan and Higgs production [231125] . 
and for searches for rare processes beyond Standard Model in high pile-up regimes. 
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